Fast Approximate Duplicate Detection for 2D-NMR Spectra

نویسندگان

  • Björn Egert
  • Steffen Neumann
  • Alexander Hinneburg
چکیده

2D-Nuclear magnetic resonance (NMR) spectroscopy is a powerful analytical method to elucidate the chemical structure of molecules. In contrast to 1D-NMR spectra, 2D-NMR spectra correlate the chemical shifts of H and C simultaneously. To curate or merge large spectra libraries a robust (and fast) duplicate detection is needed. We propose a definition of duplicates with the desired robustness properties mandatory for 2D-NMR experiments. A major gain in runtime performance wrt. previously proposed heuristics is achieved by mapping the spectra to simple discrete objects. We propose several appropriate data transformations for this task. In order to compensate for slight variations of the mapped spectra, we use appropriate hashing functions according to the locality sensitive hashing scheme, and identify duplicates by hashcollisions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Duplicate detection of 2D-NMR Spectra

2D-Nuclear magnetic resonance (NMR) spectra are used in the (structural) analysis of small molecules. In contrast to 1D-NMR spectra, 2D-NMR spectra correlate the chemical shifts of 1H and 13C at the same time. A spectrum consists of several peaks in a twodimensional space. The most important information of a peak is the location of its center, which captures the bonding relationships of hydroge...

متن کامل

Rapid acquisition of wideline MAS solid-state NMR spectra with fast MAS, proton detection, and dipolar HMQC pulse sequences.

The solid-state NMR spectra of many NMR active elements are often extremely broad due to the presence of chemical shift anisotropy (CSA) and/or the quadrupolar interaction (for nuclei with spin I > 1/2). These NMR interactions often give rise to wideline solid-state NMR spectra which can span hundreds of kHz or several MHz. Here we demonstrate that by using fast MAS, proton detection and dipola...

متن کامل

An improved ultrafast 2D NMR experiment: towards atom-resolved real-time studies of protein kinetics at multi-Hz rates.

Multidimensional NMR spectroscopy is a well-established technique for the characterization of structure and fast-time-scale dynamics of highly populated ground states of biological macromolecules. The investigation of short-lived excited states that are important for molecular folding, misfolding and function, however, remains a challenge for modern biomolecular NMR techniques. Off-equilibrium ...

متن کامل

UltraSOFAST HMQC NMR and the repetitive acquisition of 2D protein spectra at Hz rates.

Following unidirectional biophysical events such as the folding of proteins or the equilibration of binding interactions, requires experimental methods that yield information at both atomic-level resolution and at high repetition rates. Toward this end a number of different approaches enabling the rapid acquisition of 2D NMR spectra have been recently introduced, including spatially encoded "ul...

متن کامل

NMR and vibrational spectra of 2-methoxycarbonyl-7-methyl-1,3-thiazino[3,2- b][1,2,4]triazine-4,8-dione: a joint of experimental and DFT

The IR and NMR spectra were coupled with quantum chemical calculations in DFT approach usingthe hybrid B3LYP exchange-correlation functional to confirm the structure of 2-methoxycarbonyl-7-methyl-1,3-thiazino[3,2-b][1,2,4]triazine-4,8-dione 2d.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007